NLP-NG - A New NLP System for Biomedical Text Analysis

نویسندگان

  • Robert P. Futrelle
  • Jeff Satterley
  • Tim McCormack
چکیده

NLP-NG is a new NLP system consisting of three components: NG-CORE (language processing), NG-DB (database management), and NG-SEE (interactive visualization and entry). The ultimate goal of NLP-NG is to produce information retrieval systems in which users can choose full-text schema, adding specific items to focus their queries. Schema are created by a normalization process which elides adjunctive constructions as well as replacing items by prototypes. Biomedical text contains domain-specific constructions which are revealed by normalization. NLP-NG is based on Construction Grammar. Computationally, all representations are integer-based, allowing efficient storage, indexing, and retrieval. SEE, an Ajax web browser client, allows developers, linguists, and users to view a corpus and modify its properties. NLP-NG uses a 300 million word BioMed Central corpus. NLP-NG does not focus on specific strategies to extract limited classes of information from papers. Instead, it is a universal approach that can codify a wide variety of text in papers..

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

An Efficient Coupled Genetic Algorithm-NLP Method for Heat Exchanger Network Synthesis

Synthesis of heat exchanger networks (HENs) is inherently a mixed integer and nonlinear programming (MINLP) problem. Solving such problems leads to difficulties <span style="font-size: 10pt; color: #00...

متن کامل

Current issues in biomedical text mining and natural language processing

The years since 1998 have seen an explosion in work in biomedical text mining (BioNLP) of both clinical text and the biomedical literature [1]. The work focusing on the literature has been particularly stimulated by three factors. One is simply the rapid increase in the rate of publication in general, as reflected in the growth in the contents of PubMed/MEDLINE, which has been exponential. Anot...

متن کامل

Natural Language Processing in Biomedicine: A Unified System Architecture Overview

In contemporary electronic medical records much of the clinically important data-signs and symptoms, symptom severity, disease status, etc.-are not provided in structured data fields but rather are encoded in clinician-generated narrative text. Natural language processing (NLP) provides a means of unlocking this important data source for applications in clinical decision support, quality assura...

متن کامل

Current issues in biomedical text mining and natural language processing

The years since 1998 have seen an explosion in work in biomedical text mining (BioNLP) of both clinical text and the biomedical literature [1]. The work focusing on the literature has been particularly stimulated by three factors. One is simply the rapid increase in the rate of publication in general, as reflected in the growth in the contents of PubMed/MEDLINE, which has been exponential. Anot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009